PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Neem_178_f_1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Meliaceae; Azadirachta
Family Trihelix
Protein Properties Length: 446aa    MW: 51270.5 Da    PI: 7.0143
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Neem_178_f_1genomeNGDView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix96.91.8e-30258342186
      trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                   rW+k+ev aLi++r  +e+r+ +  lk+plWeevs+ m++ g++rs+k+Ckekwen+nk+++k ke+ kkr s++s+tc yfdql+
  Neem_178_f_1 258 RWPKDEVEALIQVRIGLESRFLEPGLKGPLWEEVSSLMASMGYQRSAKRCKEKWENINKYFRKAKETGKKR-SPRSKTCTYFDQLD 342
                   8*********************************************************************8.77788********8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138374.0E-21257344No hitNo description
PROSITE profilePS500906.934257315IPR017877Myb-like domain
CDDcd122037.32E-25257322No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 446 aa     Download sequence    Send to blast
MLRGNIYLNR CNNKQCDEDK APDIECNKNK KHYNHTLKQV DQRKERCKNI GTKYTQFSEL  60
EAVCNLANGK IFETSGSDLT GDKSPKDAGA VFLMSLRDMQ CTNTCEMADA DAELGSENSI  120
EEACLGKVNE NKKRKRKLKE NYSGMIEFFK CLVQQLMDHQ EGLHRKYLEA VHRMDKERAE  180
REEKWRQQET EKHNREAIAR AHEQSIASNR EDQLISLIQK ITGRSINLPP RKSALLLQPQ  240
LTKEQTKELT GMKGETNRWP KDEVEALIQV RIGLESRFLE PGLKGPLWEE VSSLMASMGY  300
QRSAKRCKEK WENINKYFRK AKETGKKRSP RSKTCTYFDQ LDQLYSRTPL NLPSSSSNPA  360
FDSDIEQQNQ GYSELLEAFA AERDHLGIAQ NTSTAGNFDV FEMGSLRLNF DGIPNNQTIE  420
FEQGRHGNEN KDCVEEHKVD GEQQVE
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1131136KKRKRK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00651PBMTransfer from LOC_Os02g01380Download
Motif logo
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002324189.21e-126hypothetical protein POPTR_0018s05520g
TrEMBLB9IKD41e-126B9IKD4_POPTR; Uncharacterized protein
STRINGPOPTR_0018s05520.11e-126(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM16469813
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.25e-47Trihelix family protein